Multi-modal Fusion Framework with Particle Filter for Speaker Tracking

نویسندگان

  • Anwar Saeed
  • Ayoub Al-Hamadi
  • Michael Heuer
چکیده

In the domain of Human-Computer Interaction (HCI), the main focus of the computer is to interpret the external stimuli provided by users. Moreover in the multi-person scenarios, it is important to localize and track the speaker. To solve this issue, we introduce here a framework by which multi-modal sensory data can be efficiently and meaningfully combined in the application of speaker tracking. This framework fuses together four different observation types taken from multi-modal sensors. The advantages of this fusion are that weak sensory data from either modality can be reinforced, and the presence of noise can be reduced. We propose a method of combining these modalities by employing a particle filter. This method offers satisfied real-time performance. We demonstrate results of a speaker localization in twoand three-person scenarios.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sequential Monte Carlo Fusion of Sound and Vision for Speaker Tracking

Video telephony could be considerably enhanced by provision of a tracking system that allows freedom of movement to the speaker, while maintaining a well-framed image, for transmission over limited bandwidth. Already commercial multi-microphone systems exist which track speaker direction in order to reject background noise. Stereo sound and vision are complementary modalities in that sound is g...

متن کامل

Integrating robust likelihoods with Monte-Carlo filters for multi-target tracking

In this paper, a dynamic multi-modal fusion scheme for tracking multiple targets with Monte-Carlo filters is presented, with the goal of achieving robustness by combining complimentary likelihoods based on color and foreground segmentation. The generality of the proposed approach allows defining the measurements on different levels (pixel-, featureand object-space) through dynamic data fusion. ...

متن کامل

A hybrid approach to 3D arm motion tracking

This paper presents a hybrid approach to 3D arm motion tracking for tele-rehabilitation applications. A particle filter (PF) algorithm is adopted in the proposed system to fuse data from inertial and visual sensors in a probabilistic manner. Multi-modal distributions of system states are propagated based on a 'factor sampling' technique. To avoid the problem of particle degeneracy in convention...

متن کامل

Multi-camera Tracking and Activity Recognition

This document describes the progress on the MUCATAR (MUltiple CAmera Tracking and Activity Recognition) IM2 White Paper Project during its second year. Building on the first year achievments on single-object tracking, the research during the second year moved into two main directions: 1) the investigation of new sampling strategies to improve tracking with particle filters, both for single and ...

متن کامل

Speaker Tracking Using Particle Filter Sensor Fusion

in Proc. of Asian Conference on Computer Vision (ACCV), 2004 Sensor fusion for object tracking has become an active research direction during the past few years. But how to do it in a robust and principled way is still an open problem. In this paper, we propose a new fusion framework that combines both the bottom-up and top-down approaches to probabilistically fuse multiple sensing modalities. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012